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Handsfree System For Use In a Vehicle 

Description 

The invention is directed to a handsfree system for use in a vehicle comprising a 
microphone array with at least two microphones and a signal processing means. 

In WO 00/30264, a method of processing signals received from an array of sen- 
sors is disclosed, the processing including filtering the signals using a first and a 
second adaptive filter. 

A system for discerning an audible command from ambient noise in a vehicular 
cabin is known from US 2002/0031234. The prior art system disclosed in this 
document includes a microphone array. Each of the microphones is coupled to a 
delay and weighting circuitry. The outputs of this circuitry are fed to a signal 
processor either directly or after being summed. According to the teaching of this 
document, the signal processor performs delay and sum processing, Griffiths-Jim 
processing, Frost processing, adaptive beamforming and/or adaptive noise reduc- 
tion. 

* 

In other words, the signal processing functions mentioned in both prior art docu- 
ments - except for the delay and sum processing - are adaptive methods. This 
means that the processing parameters such as the filter coefficients, are perma- 
nently adapted during operation of the system. Adaptive processing methods are 
costly to implement and require a large amount of memory and computing power. 
The delay and sum processing, on the other hand, shows a bad directional char- 
acteristic, in particular, for low frequencies. 

Therefore, it is the problem underlying the invention to overcome the above- 
mentioned problems and to provide a handsfree system for use in a vehicle hav- 
ing good acoustic properties, in particular, a good Signal-To-Noise-Ratio (SNR), 
a directional characteristic and is not too costly to implement. 

* 

This problem is solved by the handsfree system according to claim 1. Accord- 
ingly, the invention provides a handsfree system for use in a vehicle comprising a 
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microphone array with at least two microphones and a signal processing means 
wherein the signal processing means comprises a superdirective beamformer 
with fixed superdirective filters. 

Suprisingly, such a handsfree system shows an excellent acoustic performance in 
a vehicular environment. In particular, speech signals are enhanced and ambient 
noise is reduced. Furthermore, due to the non-adaptive beamforming with fixed 
superdirective filters, the computing power during operation is reduced. 

According to a preferred embodiment, the beamformer can be a regularized su- 
perdirective beamformer using a finite regularization parameter jj. The regulariza- 
tion parameter usually enters the equation for computing the filter coefficients or, 
alternatively, is inserted into the cross-power spectrum matrix or the coherence 
matrix. In contrast to the maximum superdirective beamformer (p = 0), the regu- 
larized superdirective beamformer has reduced noise and is less sensitive to an 
imperfect matching of the microphones. 

Preferably, the finite regularization parameter p can depend on the frequency. 
This achieves an improved gain of the array compared to a regularized superdi- 

i 

rective beamformer with fixed regularization parameter \s. 

According to a preferred embodiment, each superdirective filter can result from 
an iterative design based on a predetermined maximum susceptibility. This allows 
an optimal adjustment of the microphones particularly with respect to the transfer 
function and the position of each microphone. 

By using a predetermined maximum susceptibility, defective parameters of the 
microphone array can be taken into account to further improve the gain. The 
maximum susceptibility can be determined as a function of the error in the trans- 
fer characteristic of the microphones, the error in the microphone positions and a 
predetermined (required) maximum deviation in the directional diagram of the mi- 
crophone array. The time-invariant impulse response of the filters will be deter- 
mined iteratively only once; there is no adaption of the filter coefficients during 
operation. 
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According to a preferred embodiment, each superdirective filter can be a filter in 
the time domain. Filtering in the frequency domain is a possible alternative, how- 
ever, requiring to perform a Fourier transform (FFT) and an inverse Fourier trans- 
form (IFFT), thus, increasing the required memory. 

According to a preferred embodiment, the signal processing means can further 
comprise at least one inverse filter for adjusting a microphone transfer function. 
In this way, conventional microphones can be used for a microphone array by 
matching the microphones using the inverse filters. Alternatively or additionally, 
matched microphones on the basis of silicone or paired microphones can be 
used. 

™ * * " — • • » mm mm » , mr . ^ . 

Preferably, each inverse filter is a warped inverse filter. 

The susceptibility of microphone arrays increases with decreasing frequency. Due 
to this, a higher matching precision is necessary for low frequencies compared to 
high frequencies. A frequency depending adjustment of the microphone transfer 
functions with the use of warped filters reduces the required memory compared to 
the case of conventional FIR filters. 

* 

Preferably, each inverse filter can be an approximate inverse of a non-minimum 
phase filter. This results in an inverse filter which is both stable and has no phase 
error. 

According to a preferred embodiment, each inverse filter can be combined with a 
superdirective filter of the beamformer. Such a coupling of the filters results in a 
simplified implementation. 

According to a preferred embodiment, the beamformer can have the structure of a 
Generalized Sidelobe Canceller (GSC). In this way, at least one filter can be 
saved. The implementation in the GSC structure is only possible in the frequency 
domain. 
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In order to obtain an optimal adaption of the handsfree system to a particular 
noise situation, according to a preferred embodiment, the beamformer can be a 
Minimum Variance Distortionless Response (MVDR) beamformer. 

According to a preferred embodiment, at least two microphones are arranged in 
endfire orientation with respect to a first position. An array in endfire orientation 
has a better directivity and is less sensitive to a mismatched propagation or tran- 
sit time compensation. The first position can be the location of the driver's head, 
for example. 

According to a preferred embodiment, the microphone array comprises at least 
two microphones being arranged in endfire orientation with respect to a second 
position. Thus, the handsfree system of the invention has a good directivity in two 
directions. Speech signals coming from two different positions, for example, from 
the driver and the front seat passenger, can both be recorded in good quality. 

Preferably, the at least two microphones in the first endfire orientation (endfire 
orientation with respect to a first position) and the at least two microphones in the 
second endfire orientation (endfire orientation with respect to a second position) 
can have a microphone in common. In this way, already a microphone array con- 
sisting of only three microphones can provide an excellent directivity for use in a 
vehicular environment 

According to a preferred embodiment, the microphone array can comprise at least 
two subarrays. Each subarray can be optimized for a specific frequency band 
yielding an improved overall directivity. 

To decrease the total number of microphones, preferably, at least two subarrays 
can have at least one microphone in common. 

According to a preferred embodiment, the handsfree system can comprise a 
frame wherein each microphone of the microphone array is arranged in a prede- 
termined, preferably fixed, position in or on the frame. This ensures that after 
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manufacture of the frame with the microphones, the relative positions of the mi- 
crophones are known. Such an array can be easily mounted in a vehicular cabin. 

According to a preferred embodiment, at least one microphone can be a direc- 
tional microphone. The use of directional microphones improves the array gain. 

Preferably, at least one directional microphone can have a cardioid characteristic. 
This further improves the array gain. More preferred, the cardioid characteristic is 
a hypercardioid characteristic. 

* 

According to preferred embodiment, at least one directional microphone can be a 
differential microphone. This results in a microphone array with excellent directiv- 
ity and small dimensions. In particular, the differential microphone can be a first 
order differential microphone. 

The invention is further directed to a vehicle, in particular, a car, comprising any 
of the above described handsfree systems. 

The invention is also directed to the use of any of the previously described 
handsfree systems in a vehicle. 

Additional features and advantages of the invention will be described with refer- 

* 

ence to the drawings: 

Figure 1 shows the structure of a beamformer in the frequency domain 



with four microphones; 



Figure 2 



illustrates an FXLMS algorithm; 



Figure 3 



shows a realization of beamforming filters in the time domain; 



Figure 4A 



illustrates a preferred embodiment of arrangements of a mi 

* 

crophone array in a vehicle; 
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Figure 4B 



illustrates another preferred embodiment of an arrangement of 
a microphone array in a vehicle; 



Figures 5A 



illustrates a preferred embodiment of an arrangement of a mi 
crophone array in a mirror; 



Figures 5B 



illustrates another preferred embodiment of an arrangement of 
a microphone array in a mirror; 



Figure 6 



shows a microphone array consisting of three subarrays; 



Figure 7 



illustrates a superdirective beamformer in GSC structure; 



Figure 8 



illustrates a microphone array with two microphones in a noise 
field with a noise-free sector; and 



Figure 9 



shows a superdirective beamformer comprising four first order 
gradient microphones. 



The structure of a superdirective beamformer is shown in Figure 1. The array 
consists of M microphones 1, each yielding a signal Xj(t). The beamformer shown 
in this figure performs the filtering in the frequency domain. Therefore, the signals 
xi(t) are transferred to the frequency domain by a fast Fourier transform (FFT) 2, 
resulting in a signal Xt(u)). In general, the beamforming consists of a beamsteer- 
ing and a filtering. The beamsteering is responsible for the propagation time 
compensation. The beamsteering is performed by the steering vector 

d(eo) = [a 0 e~ n,fXa ,a x e~ i2 ^ ,...,a M _ x e~ i2 *""\. 



with 



Jig" Pref II 
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and 



c 



wherein p ref denotes the position of a reference microphone, p n the position of 

microphone n t q the position of the source of sound (for example, the speaker), 
/ the frequency and c the velocity of sound. In the far field, one has 



a « = a . = • - - = a , = 1 



According to a rule of thumb, one has the far field situation if the source of the 
useful signal is more than twice as far from the microphone array as the maxU 
mum dimension of the array. In Figure 1, a far field beamformer is shown since 

only a phase factor e J<OTk denoted by reference sign 3 is applied to the signals 

« 

After the beamsteering, the signals are filtered by the filters 4. The filtered signals 
are summed yielding a signal Y(a>). After an inverse fast Fourier transform 

(IFFT), the resulting signal y[k] are obtained. 

The optimal filter coefficients A t {p) can be computed according to 



wherein the superscript H denotes Hermitian transposing and r(u)) is the complex 
coherence matrix 
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r Xl x 2 M -v r ^(^) 

1 - r x ^ M (p>) 



\ 



the entries of which are the coherence functions that are defined as the normal- 
ized cross-power spectral density of two signals 



P Xl Xj (*>) 



Preferably, the beamsteering is separated from the filtering step which reduces 
the steering vector in the design equation for the filter coefficients to the 

unity vector 



(The superscript T denotes transposing.) 

In the case of an isotropic noise field in three dimensions (diffuse noise field), the 
coherence is given by 



r 



l7zfd i} 



\ 



v c J 



2/j/ifyCOS0 O 



with 5/00 = 



sin x 



and wherein d iS denotes the distance between microphones / and j and 0 o is the 
angle of the main receiving direction of the microphone array or the beamformer. 



The above described design rule for computing the optimal filter coefficients 
4(o)for a homogenous diffuse noise field is based on the assumption that the 
microphones ar& perfectly matched, i.e. point-like microphones having exactly the 
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same transfer function. In practice, therefore, a so-called regularized filter design 
can be used to adjust the filter coefficients. To achieve this, a scalar (the regu- 
larization parameter u) is added at the main diagonal of the cross-correlation ma- 
trix. In a slightly modified version, all elements of the coherence matrix not on the 
main diagonal are divided by (1 + u): 



\ 



T yr (a) c f *0Q*»*O 

J l + fi 1 + ju J 

* 

Alternatively, the regularization parameter u can be introduced into the equation 
for computing the filter coefficients: 



^ V ' d T (T(<o) + n /)-' 



wherein I is the unity matrix. For convenience, in the following, the second ap- 
proach where the regularization parameter is part of the filter equation will be 
discussed in more detail. It is to be understood, however, that the first approach 
is equally suitable. 

Before discussing the superdirective beamformer in more detail, some character- 
istic quantities of a microphone array are to be defined. The directional diagram 
or response pattern W(co,®) of a microphone array characterizes the sensitivity of 
the array as a function of the direction of incidence © for different frequencies . 

A measure to describe the directivity of an array is the so-called gain that does 
not depend on the angle of incidence © . The gain is defined as the sensitivity of 
the array in the main direction of incidence with respect to the sensitivity for om- 
nidirectional incidence. 

The Front-To-Back-Ratio (FBR) indicates the sensitivity in front receiving direc- 
tion compared to the back. 
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The white noise gain (WNG) describes the ability of the array to suppress uncor- 
related noise, for example, the inherent noise of the microphones. The inverse of 
the white noise gain is the susceptibility K(eo),: 



1 = A(a>) H A(a>) 
K } WNG(pj) \A(<a) H d(co)\' 



The susceptibility K(co) describes the array's sensitivity to defective parameters. 
It is often preferred that the susceptibility K(a>) of the array filters does not 
exceed an upper bound K^ip). The selection of this upper bound can be de- 
pendent on the relative error A 2 (a>,®) of the microphones and, for example, on 
requirements regarding the directional diagram ^(^,0). The relative error 
A 2 (cy,0) , in general, is the sum of the mean square error of the transfer proper- 
ties of all microphones e 2 (a>,®) and the Gaussian error with zero mean of the 
microphone positions S 2 (a). 



Defective array parameters may also disturb the ideal directional diagram; the 
corresponding error can be given by A 2 (co 3 ®)K(co) . If one requires that the devia- 
tions in the directional diagram do not exceed an upper bound of AT m (fi),0), one 
obtains for the maximum susceptibility: 

max ^ ' v / n , , c2 /- \ 

It is to be noted that in many cases the dependence on the angle 0 can be ne- 
glected. 

In practice, the error in the microphone transfer functions s(co) has a higher influ- 
ence on the maximum susceptibility K^co) and, thus, also on the maximum 
possible gain G(co) than the error S 2 (co) in the microphone positions. In other 

words, the defective transfer functions are mainly responsible for the limitation of 
the maximum susceptibility. 
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A higher mechanical precision to reduce the position deviations of the micro- 
phones is only sensible up to a certain point since the microphones usually are 
modeled as being point-like, which is not true in reality. Thus, one can fix the po- 
sitioning errors 5 2 (&) to a specific value, even if a higher mechanical precision 
could be achieved. For example, one can take 5 2 (<») = 1% which is quite realistic. 
The error e(&)) can be derived from the frequency depending deviations of the 
microphone transfer functions. 

To compensate the above-mentioned errors, inverse filters can be used to adjust 
the individual microphone transfer functions to a reference transfer function. Such 
a reference transfer function can be the transfer function of one microphone out 
of the array or, for example", the mean of all measured transfer functions. In case* 
of the first possibility, only M-l inverse filters (M being the number of micro- 
phones) are to be computed and implemented. 

In general, the transfer functions are not minimal phase, thus, a direct inversion 
would yield instable filters. Usually, one inverts only the minimum phase part of 
the transfer function (resulting in a phase error) or one inverts the ideal (non- 
minimum phase) filter only approximately. In the following, the approximate inver- 
sion with the help of an FXLMS (filtered X least mean square) or the FXNLMS 
(filtered X normalized least mean square) algorithm will be described. 

* 

After computing of the inverse filters, they can be coupled with the superdirective 
filters A,(a)) such that, in the end, only one filter per viewing direction and micro- 
phone is to be implemented. 

The FXLMS or the FXNLMS algorithm is described with reference to Figure 2. 
The error signal e[n] at time n is calculated according to 



e[n] = d\ri] - y[n] 

= (p 7 [«]x[«])-(w r [n]x'[n]) 



■ 
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with the input signal vector 
x[ri] = [x[n], x\n - l],...,x[n - L + l]] r 

wherein L denotes the filter length of the inverse filter W{z) . The filter coefficient 
vector of the inverse filter has the form 

w[/?] = [w Q [n], w 1 [n] > ..,w M [/i]r , 

the filter coefficient vector of the reference transfer function P(z) 

pm = [p 0 [»i Pi m>. • ->/>l-i mF 

and the filter coefficient vector of the n-th microphone transfer function S(z) 

The update of the filter coefficients of w[w] is performed iteratively, i.e. at each 
time step n, whereby the filter coefficient w[«] are computed such that the in- 
stantaneous squared error e 2 \n\ is minimized. This can be achieved, for example, 
by using the LMS algorithm: 

w[« + 1] = w [n] + // x 1 [n] e[n] 

or by using the NLMS algorithm 

w[n + 1] = w[w] + ^ x'MeW 

x [«] x'fn] 

wherein jj characterizes the adaption steps and 
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x' [if] = [x'[n], x'[« - 1], . . ., x' [n - L + l]f 

* 

denotes the input signal vector filtered by S(z). 

In general, the susceptibility increases with decreasing frequency. Thus, it is pre- 
ferred to adjust the microphone transfer functions depending on frequency, in 
particular, with a high precision for low frequencies. To achieve a high precision 
of the inverse filters, the FIR filters, for example, are to be very long in order to 
obtain a sufficient frequency resolution in the desired frequency range. This 
means that the expenditure, in particular, regarding the memory, increases rap- 
idly. When using a reduced sampling frequency of, for example, f m = SkHz, the 

" computing time does riot impose a severe limitation. A suitable frequency de- 
pending adaption of the transfer functions can be achieved by using short WFIR 
filters (warped filters). 

One possible iterative method to design the filters A,(oj) with predetermined sus- 
ceptibility goes as follows: 

1. Set = 1. 

2. Determine the transfer functions of the filters 4(oi) and the resulting sus- 
ceptibilities K(a>) according to the equations: 




and 



K(a>) = 



1 _ Ajcof Ajfo) 



WNG{co) \AHpfd(a>)\ 



WO 2005/004532 



14 



PCT/EP2004/007110 



3. If the susceptibility K{a>) is larger than the maximum susceptibility 

* 

(Kip) > (K^ (&)) , increase p in the following step, otherwise, decrease p. 

4. Repeat steps 2 and 3 until the susceptibility K(co) is sufficiently close to 
the predetermined value K^o). The iteration is to break off if p becomes 
smaller than a lower limit of, for example, ^ = 10~ 8 . Such a termination 
criterion is mainly necessary for high frequencies / >c/(2d mic ). 

Of course, there are other possibilities to compute the filters A ( (co). For example, 

one can use a fixed parameter fj for all frequencies. This simplifies the computa- 
tion of the filter coefficients. It is to be noted that the above iterative method is 
not used for a real time adaption of the filter coefficients during operation. 

A realization of the beamforming filters in the time domain is described with ref- 
erence to Figure 3. Again, signals are recorded by microphones 1. A near field 
beamsteering 5 is performed using gain factors v fc 51 to compensate for the am- 
plitude differences and time delays x k 52 to compensate for the transit time dif- 
ferences of the microphone signals x k [i]. The realization of the superdirective 

beamforming is achieved using the filters (preferably, FIR filters) a k (i) indicated 
by reference sign 6. 

The impulse responses a x (i\...,a M Q) can be determined as follows: 

1. Determine the frequency responses A,((o) according to the above equation. 

2. To obtain real valued impulse responses a, (*),..., a M (0, chose the frequency 
responses above half of the sampling frequency to (a, (eo) = A* (o> A -co)) with 
<x> A denoting the sampling angular frequency. 

■ 

3. Transfer these frequency responses to the time domain using an IFFT 
yielding the desired FIR filter coefficients a x (i),...,a M (i). 
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4. Applying a window function, for example, a Hamming window, to the FIR 
filter coefficients a, (/),... ,d M (f). 

As can be seen in Figure 3, in contrast to the beamforming in the frequency do- 
main as described above, the microphone signals are directly processed using 
the beamsteering 5 in the time domain. The beamsteering 5 is followed by the 
FIR filtering 6. After summing the filtered signals, a resulting enhanced signal 
y[k] is obtained. 

Depending on the distance between speaker and microphone array, on the dis- 
tance between the microphones themselves, and on the sampling frequency / 

more or less propagation or transit time between the microphone signals is to be 
compensated. The following equation is to be taken into account: 

max " ' 

c 

The higher the sampling frequency f a or the higher the distance between adja- 
cent microphones, the more transit time (in taps of delay) is to be compen- 
sated for. The number of taps increases also if the distance between speaker and 
microphone arrays is decreased. In the near field, more transit time is to be com- 
pensated for than in the far field. It turns out that an array in endfire orientation is 
less sensitive to a defective transit time compensation A,^ than an array in 
broad-side orientation. 

In a vehicle, the average distance between the speaker, in particular, its head, 
and the array is about 50cm. Due to a movement of the head, this distance can 
change of about +/- 20cm. If a transit time error of 1 tap is acceptable, the dis- 
tance between the microphones in broad-side orientation with a sampling fre- 
quency of f a =8kHz should be smaller than about d mle ^ (broad - side) = 5cm . With 
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the same conditions, the maximum distance between the microphones in endfire 
orientation can be about m (endfire) = 20cm. 



On the other hand, having a distance between the microphones of about 5cm, it 
turns out that a sampling frequency of f a = 16kHz provides excellent results for an 
endfire orientation whereas in broad-side orientation, only a sampling frequency 
of f Q = Skffz can be used without adaptive beamsteering. In other words, in end- 
fire orientation, the sampling frequency or the distance between the microphones 
can be chosen much higher than in the broad-side case, thus, resulting in an im- 
proved beamforming. 

■ 

In this context, it is to be pointed out that the larger the distance between the mi- 
crophones, the sharper the beam, in particular, for low frequencies. A sharper 
beam at low frequencies increases the gain in this range which is important for 
vehicles where the noise is mostly a low frequency noise. 

However, the larger the microphone distance, the smaller the usable frequency 
range according to the spatial sampling theorem 



2d mic 



A violation of this sampling theorem has the consequence that at higher frequen- 
cies, large grating lobes appear. These grating lobes, however, are very narrow 
and deteriorate the gain only slightly. The maximum microphone distance that 
can be chosen depends not only on the lower limiting frequency for the optimiza- 
tion of the directional characteristic, but also on the number of microphones and 
on the distance of the microphone array to the speaker. In general, the larger the 
number of microphones, the smaller their maximum distance in order to optimize 
the Signal-To-Noise-Ratio (SNR). For a distance between array and speaker of 
50cm, the microphone distance, preferably, is about d Mc =40cm with two micro- 
phones (M = 2) and about = 20cm for M = 4. 
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A further improvement of the directivity, and, thus, of the gain, can be achieved 
by using unidirectional microphones instead of omnidirectional ones; this will be 
discussed in more detail below. 

Figures 4A and 4B show preferred arrangements of microphone arrays in a vehi- 
cle. In general, the distance between the microphone array and the speaker 
should be as small as possible. 

According to a first embodiment (Figure 4A), each speaker 7 can have its own 
microphone array comprising at least two microphones 1. The microphone arrays 
can be provided at different locations, for example, within the headliner, dash- 
board, pillar, headrest, steering wheel, compartment door, visor or (driving) mir- 
ror. An arrangement within the roof is also a preferred possibility that is, however, 
not suitable for the case of a cabriolet. Both microphone arrays for each speaker 
are in endfire orientation. 

In an alternative embodiment (Figure 4B), one microphone array is used for two 
neighboring speakers. In both embodiments, preferably, directional microphones, 
in particular, having a cardioid characteristic, can be used. 

In the embodiment of Figure 4B, the microphone array can be mounted within the 
mirror Such a linear microphone array can be used for both the driver and the 
front seat passenger. A costly mounting of the microphones in the roof can be 
avoided. Furthermore, the array can be mounted in one piece, which ensures a 
high mechanical precision. Due to the adjustment of the mirror, the array would 
always be correctly oriented. 

Figure 5A shows a top view on a (driving) mirror 1 1 of a car with three micro- 
phones in two alternative arrangements. According to the first alternative, two 
microphones 8 and 9 are located in the center of the mirror in endfire orientation 
with respect to the driver and, preferably, have a distance of about 5cm between 
each other. The microphones 9 and 10 are in endfire orientation with respect to 
the front seat passenger and have a distance of about 10cm between each other. 
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Since the microphone 9 is used for both arrays, a cheap handsfree system can be 
provided. 

All three microphones can be directional microphones, preferably having a cardi- 
oid characteristic, for example, a hypercardioid characteristic. Alternatively, mi- 
crophones 8 and 10 are directional microphones, whereas microphone 9 is an 
omnidirectional microphone which further reduces the costs. If all three micro- 
phones are directional microphones, preferably, microphones 8 and 9 are di- 
rected towards the driver. 

« 

Due to the larger distance between microphones 9 and 10 than between micro- 
phones 8 and 9, the front seat passenger beamformer has a better SNR at low 
frequencies. 

- « + m m , _ 

According to an alternative embodiment, the microphone array for the driver con- 
sists of microphones 8' and 9' located at the left side of the mirror. In this case, 
the distance between this microphone array and the driver would be increased, 
thus, decreasing the performance. On the other hand, the distance between mi- 
crophone 9' and 10 would be about 20cm, which yields a better gain for the front 
seat passenger at low frequencies. 

A variant of two microphone arrays with improved precision is shown in Figure 
5B. Also in this case, all microphones can be directional microphones, micro- 
phones 8 and 9 being directed to the driver, microphones 10 and 12 being di- 
rected to a front seat passenger. In this example, the microphone array for the 
front seat passenger comprises the three microphones 9, 10 and 12, which in- 
creases the gain considerably. 

It is to be noted that these arrangements are only examples that can be varied by 
changing the position and number of the microphones. In particular, an arrange- 
ment can be optimized with regard to a specific vehicular cabin. 

Figure 6 illustrates a microphone array comprising three subarrays 13, 14, and 
15, each subarray ^consisting of five microphones. Within each subarray 13, 14, 
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and 15, the microphones are equidistantly arranged. In the total array 16, the dis- 
tances are no longer equal. As can be seen in this figure, some microphones are 
used for different arrays, therefore, for the total array, only 9 microphones and 
not 3-5 = 15 microphones are necessary. 

In this figure, it is further indicated that the different subarrays are used for differ- 
ent frequency ranges. The resulting directional diagram is then built up of the di- 
rectional diagrams of each subarray for the respective frequency range. For the 
special case of Figure 6, subarray 13 with d me =5cm is used for the frequency 
band of 1400 - 3400 Hz, subarray 14 with = 10cm with for the frequency band 
of 700 - 1400 Hz, and subarray 15 with d^ = 20cm for the band of frequencies 
smaller than 700 Hz. A lower limit of this frequency band can be imposed, for ex- 
ample, by the lowest frequency of the telephone band (the frequencies used in 

■ 

telephone applications) which, presently, is 300 Hz in most cases. 

An improved directional characteristic can be obtained if the superdirective beam- 
former is designed as general sidelobe canceller (GSC). In this structure, at least 
one filter can be saved. Such a superdirective beamformer in GSC structure is 
shown in Figure 7. The GSC structure is to be implemented in the frequency do- 
main, thus, an FFT 3 is applied to the incoming signals x k (t). Before the general 

sidelobe cancelling, a time alignment using phase factors e imi has to be per- 
formed (in this figure, a far field beamsteering is shown). 

In Figure 7, X denotes a vector comprising all time aligned input signals X t (co). 
A° is a vector comprising all frequency independent filter transfer functions A, 
that are necessary to observe the constraints in viewing direction; H is the vector 
of the transfer functions performing the actual superdirectivity; and B is the so- 
called blocking matrix projecting the input signals in X onto the "noise plane". 
The signal Y DS (ri) denotes the output signal of the delay and sum beamformer, 
Ybm(P>) the resulting output signal of the blocking branch, T w {a>) the output signal 
of the superdirective beamformer x t (t) and X,(a>) the input signals in the time 



WO 2005/004532 



PCT/EP2004/007110 



20 



and frequency domain that are not yet time aligned, and Y.(a>) the output signals 
of the blocking matrix that ideally should block completely the desired or useful 
signal within the input signals. The signals 7 f (o>) ideally only comprise the noise 
signals. 

In addition to the superdirective output signal, a GSC structure also yields a delay 
and sum beamformer signal and a blocking output signal. The number of filters 
that can be saved using the GSC, depends on the choice of the blocking matrix. 
Usually, a Walsh-Hadamard blocking matrix is preferred instead of a Griffiths-Jim 
blocking matrix since more filters can be saved with a Walsh-Hadamard blocking 
matrix. Unfortunately, the Walsh-Hadamard blocking matrix can only be given for 
arrays consisting of M = 2" microphones. 



In principle, a blocking matrix should have the following properties: 

1. It is a (M-l)-M -Matrix. 

■ 

2. The sum of the values within one row vanishes. 

3. The matrix is of rank M-l. 

* 

A Walsh-Hadamard blocking matrix for n = 2 has the following form 



B = 



1 1 -1 -1" 

l -l -i l 
l-i l -l 



According to an alternative embodiment, a blocking matrix according to Griffiths 
Jim can be used which has the general form 
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B = 



1 -1 0 
0 1 -1 



0 0 



• • • 



0 
0 



1 -1 



The upper branch of the GSC structure is a delay and sum beamformer with the 
transfer functions 



-ir 



_L _L 

M'M' 



' M 



— 

M 



The computation _of the filter coefficients of a superdirective beamformer in GSC 
structure is slightly different compared to the conventional superdirective beam- 
former. The transfer functions H f (o>) are to be computed as 

H,(a>) = T^ m (a>)A c ), 

wherein B is the blocking matrix and ^(o) the matrix of the cross-correlation 

power spectrum of the noise. In the case of a homogenous noise field, ® m (a>) 

can be replaced by the time aligned coherence matrix of the diffuse noise field 
T(a>), as previously discussed. 



A regularization and the iterative design with predetermined susceptibility can be 
performed in the same way as above. 

All previously discussed filter designs only assume that the noise field is ho- 
mogenous and diffuse. These designs can be generalized by excluding a region 
around the main receiving direction © 0 when determining the homogenous noise 
field. In this way, mainly the Front-To-Back-Ratio can be optimized. This is illus- 
trated in Figure 8 where a sector of +I-S is excluded. The computing of the two- 
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dimensional diffuse (cylindrically isotropic), homogenous noise field can be 
formed using the new design parameter S : 



- B 0 -S+2x / fr^fr cob / 2^H tf co3e 0 ^ 

This method can also be generalized to the three-dimensional case. Then, in ad- 
dition to the parameter 8 being responsible for the azimuth, a further parameter 
p is to be introduced for the elevation angle. This yields an analog equation for 
the coherence of the homogeneous diffuse 3D noise field. 

A superdirective beamformer based on an isotropic noise field is particularly use- 
ful for a handsfree system which is to be installed later in a vehicle. This is the 
case, for example, if the handsfree system is installed in the vehicle by the user 
itself. On the other hand, an MVDR beamformer can be relevant if there are spe- 
cific noise sources at fixed relative positions or directions with respect to the 
position of the microphone array. In this case, the handsfree system can be 
adapted to a particular vehicular cabin by adjusting the beamformer such that its 
zeros point into the direction of specific noise sources. For example, such a noise 
source can be formed by a loudspeaker or a fan. Preferably, a handsfree system 
with MVDR beamformer is already installed during manufacture of the vehicle. 

The typical distribution of noise or noise sources in a particular vehicular cabin 
can be determined by performing corresponding noise measurements under ap- 
propriate conditions (e.g., driving noise with and/or without loudspeaker and/or 
fan noise). The measured data are used for the design of the beamformer. It is to 
be noted that also in this case, no further adaption is performed during operation 
of the handsfree system. 



Alternatively, if the relative position of a noise source is known, the correspond- 
ing superdirective filter coefficients can also be determined theoretically. 
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As already stated above, the use of directional microphones further improves the 
signal enhancement. Figure 9 shows a superdirective beamformer with directional 
microphones 17. In this figure, each directional microphone 17 is depicted by its 
equivalent circuit diagram. In these circuit diagrams, d DMA denotes the (virtual) 
distance of the two omnidirectional microphones composing the first order pres- 
sure gradient microphone in the circuit diagram. T is the (acoustic) delay line 
fixing the characteristic of the directional microphone and EQ^ is the equalizing 

low path filter yielding a frequency independent transfer behavior in viewing 
direction. 

In practice, these circuits and filters can.be realized purely mechanically by tak- 
ing an appropriate mechanical directional microphone. Again, the distance be- 
tween the directional microphones is d^. In Figure 9, the whole beamforming is 
performed in the time domain. A near field beamsteering is applied to the signals 
x„[i] coming from the microphones and being filtered by the equalizing filter 
EQv- The gain factors v„ compensate for the amplitude differences and the de- 
lays r„ for the transit time differences of the signals. The FIR filters a n [i] realize 
the superdirectivity in the time domain. 

Mechanical pressure gradient microphones have a high quality and yield, in par- 
ticular, using a hypercardioid characteristic, an excellent array gain. The use of 
directional microphones results in an excellent Front-to-Back-Ratio as well. 

All previously discussed embodiments are not intended as limitations but serve 
as examples illustrating features and advantages of the invention. It is to be un- 
derstood that some or all of the above described features can also be combined 
in different ways. 



